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@ Human transforming growth factor and precursor or fragment thereof, cells, DNA, vectors and methods for their 
production, compositions and products containing them, and related antibodies and diagnostic methods. 

© Recombinant DNA technology is used to produce the 
transforming growth factor-a (TGF-a) precursor and its 
fragments. TGF-a precursor and its fragments, including 
particularly mature TGF-a, and optionally in combination 
with human transforming growth factor-f3, has utility for 
example in the therapeutic treatment of human beings for 
bone diseases and to accelerate wound healing, in addition, 
TGF-a is useful as an adjuvant for cell culture so as to reduce 
requirements for serum in the culture media {thereby 
resulting in purification advantages), and to stimulate or 
enhance ceil growth in cell culture. Also, preparation of large 
quantities of the TGF-a precursor and its fragments enables 
the preparation of reagents for the assay of the TGF-a 
precursor and its fragments in body fluids for the diagnosis 
of neoplastic or other diseases. 
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HUMAN TRANSFORMING GROWTH FACTOR AND PRECURSOR 
OR FRAGMENT THEREOF , CELLS, DNA, VECTORS AND 
METHODS FOR THEIR PRODUCTION , COMPOSITIONS 
AND PRODUCTS CONTAINING THEM, AND RELATED 
ANTIBODIES AND DIAGNOSTIC METHODS 



The present invention relates to human precursor trans- 
forming growth factor-a and its fragments, notably mature human 
transforming growth factor-a (TGF-a), corresponding to that found in 
human tissues and to novel forms and compositions thereof and 
methods for production to homogeneity in therapeutically and/or 
diagnostically significant quantities. 

The present invention enables the production of sufficient 
quantities of high purity material in comparison to the isolation 
methods previously employed involving production and extraction from 
existing cell cultures, which naturally include undesired proteins 
and which are only available in extremely small quantities. 

The publications and other materials cited herein are 

incorporated herein by reference and, for convenience, generally are 
numerically referenced in the following text and respectively 
groupedVin the appended bibliography. 
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Transforming growth factors {TGFs) are factors which can 
elicit a phenotypical transformation of normal cells in a reversible 
way. It has been shown that administration of TGFs apparently 
stimulates normal cells to undergo uncontrolled growth of the cells 

5 and promotes anchorage independence as measured by formation of 

transformed cell colonies in soft agar (1-3). Two classes of TGFs 
have been distinguished. TGF-a is secreted by a wide variety of 
tumor cells from human or rodent origin (4-7). TGF-a and epidermal 
growth factor (EGF) are reported to compete for the same receptor 

1C (5, 8, 9), which is phosphorylated at tyrosine residues after the 
binding of TGF-a or EGF (10-11). Some evidence has been presented 
for the presence of another receptor, specific for TGF-a (12). The 
anchorage independent growth triggered by TGF-a is strongly 
potentiated by TGF-b (13, 14). This latter TGF has been detected in 

15 many normal and tumor cells (13-17) and has been purified from 
kidneys (18), placenta (19) and platelets (20). TGF-b is not 
believed to bind to the EGF receptor and is believed to require EGF 
or TGF-a for its transforming activity or NRK cells (13-17). 

20 The biological role of the TGFs has not been clearly 

elucidated. Many studies suggest that TGFs may play an important 
role in the transformation event. It has been shown that cellular 
transformation with retroviruses (21-24), SV40 (25) or polyoma virus 
(26) results in the secretion of TGF-a. The tight linkage of TGF 

25 secretion to cellular transformation has been indicated by 

transfection experiments with polyoma virus DMA, which show that 
introduction of the DNA segment for middle T antigen is needed and 
sufficient to trigger both the transformed pheno type and the TGF 
secretion (26), Transformation studies with a temperature-sensitive 

;- : mutant of Kirsten murine sarcoma virus also indicate that TGF-a is 
secreted only when phenotypic transformation occurs- at the 
permissive temperature (21). In addition, recent studies indicate 
that introduction of the cloned T24 bladder oncogene induces TGF 
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production (27). The biological relevance in tumor development is 
also suggested by the presence of activity identified as that of 
TGF-a in the urine of cancer patients, in contrast to the normal 
controls (28-30). The assay used, however, would not distinguish 

S tiie activity of other growth factors such as EGF. These and other 
observations suggest that TGF- a may play a role in tumor formation 
via an autocrine mechanism, by which the TGFs are secreted by the 
transformed cells and maintain and stimulate this transformed 
character of the same cell population (31-32). However, the 

10 potentiating effect of TGF-b may be, neede^ as ^Uflji^ted ty /the 

secretion of both TGF'-a 'and' -b by tunior cetlV (14)1* Tn this' way, 
TGF-a may be a very potent effector molecule during malignant 
transformation. 

15 Heterogeneous molecular weights for TGF-a are reported for 

extracts and supernatants from tumor cells (5, 12, 22, 33-34) and in 
urine (28-30). A small species of about 7 kilodaltons has been 
purified from both rodent (27, 35) and human (34, 35) cell sources. 
Amino acid sequence analysis of this rat and mouse TGF-a shows some 

20 homology with EGF (27, 35), The reported partial polypeptide 

sequence of the human TGF-a shows a strong homology with the rat and 
murine species (35). 

It has been observed that patients with metastasized renal 
25 cell carcinoma can develop a progressive decalcification of the 

bone, which is reflected in a humoral hypercalcemia (36). A recent 
study using impure protein preparations suggests that transforming 
growth factors (including TGF-a) may cause bone resorption in a 
tissue culture system (37). 

30 

TGF-a can only be produced in such limited quantity by 
virus transformation of cells as to be impractical for 
aforementioned use, as a.therapeutic or diagnostic reagent. 



35 
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Recombinant DMA Technology 

Heterologous proteins, i.e., proteins not normally produced 
by a cell, are synthesized by cells that have been transformed by 

5 exogenous DNA. This typically is accomplished by introducing 

foreign DNA into a cell in the form of a vector. DMA recombination 
of the elements of a vector, i.e., an origin of replication, one or 
more phenotypic selection characteristics, an expression promoter, 
heterologous gene insert and remainder of the vector, generally is 

10 performed outside the host cell. The resulting recombinant 

replicable expression vehicle, or plasmid, is introduced into cells 
by transformation and large quantities of the recombinant vehicle 
obtained by growing the transformant. Where the gene is properly 
inserted with reference to portions which govern the transcription 

15 and translation of the encoded DNA message, the resulting expression 
vehicle is useful to actually produce the polypeptide sequence for 
which the inserted gene codes, a process referred to as expression. 
The resulting product may be obtained from intracellular locations 
by lysing the host cell, or from culture media in the case of 

20 secreted products and thereafter recovered by appropriate 
purification from contaminant proteins. 



Summary of the Invention 



25 The present invention is based upon the discovery that 

recombinant DNA technology can be used successfully to produce the 
transforming growth factor-a (TGF-a) precursor and its fragments in 
amounts sufficient to initiate and conduct animal and clinical 
testing as prerequisites to market approval. TGF-a precursor and 

30 its fragments, including particularly mature TGF-a, and optionally 
in combination with human transforming growth factor-s, has utility 
for example in the therapeutic treatment of human beings for bone 
diseases and to accelerate wound healing. In addition, TGF-a is 
useful as an adjuvant for cell culture so as to reduce requirements 
***^g^*tt^mt-'in the culture- metfta i tftto-^>«ul tfrig l i n purification 
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advantages), and to stimulate or enhance cell growth in cell 
culture. Also, preparation of large quantities of the TGF-a 
precursor and its fragments enables the preparation of reagents for 
the assay of the TGF-a precursor and its fragments in body fluids 
5 for the diagnosis of neoplastic or other diseases. 

The complete nucleotide and imputed amino acid sequence for 
the TGF-a precursor is depicted in Fig. 3. For convenience in 
identifying the various principal domains of the precursor we have 

4,0 designated three polypeptides, mature TGF-a (depicted as the boxed 
sequence in Fig. 3 at residues 40-89), the TGF-a C-terminal polypep- 
tide at residues 90-160 (principally the 62 residue polypeptide from 
Gin 98 to Val 160), designated herein as TGF-aC, and the TGF-a 
N-terminal polypeptide from Met 1 to Ala 22 (TGF-aN). As such 

15 therefore, the TGF-a precursor is a TGF-a bearing polypeptide 

including mature TGF-a and is in effect a fusion of TGF-a with its 
normal flanking sequences. Also, for convenience, the TGF-a pre- 
cursor and its fragments (including mature TGF-a, TGF-aC and TGF-aN) 
collectively will be referred to herein as PRTGF-a species. The 

20 term TGF-a shall mean mature TGF-a. Also, unless otherwise stated 
the term PRTGF-a species shall be deemed to include PRTGF-a species 
derivatives such as amino acid sequence mutants as are more fully 
described infra . Another PRTGF-a species fragment included herein 
is composed of TGF-aC and mature TGF-a. This polypeptide, desig- 

25 nated TGF-omC and extending from residues 20-160, is believed to be 
the an early product of precursor TGF-a expression in mammalian 
cells since the TGF-aN sequence is believed to only function as a 
signal which would be cleaved from TGF-omC upon processing by the 
endoplasmic reticulum. These principal domains may not be found iji 

30 vivo with precisely the designated amino or carboxyl termini as some 
variation in cellular processing is to be expected. 

„P.RTGF-a, -fragments generally (a) exhibit some Mological 
activity in common with PRTGF-a, e.g., antibody cross-reactivity or 
35 induction of anchorage independence, (b) exhibit substantial homology 
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with some region in PRTGF-a and (c) are at least about 5 amino acid 
residues long, and are ordinarily about from 10 to 130 residues in 
length. 

5 The invention provides methods for assaying such heretofore 

unidentified polypeptide fragments of PRTGF-a as TGF-aN and TGF-aC, 
and provides methods for assaying same without interference from 
other peptides encompassing or containing part of these sequences. 

10 Since the present invention now makes the complete amino 

acid sequence of PRTGF-a species known it is possible for the first 
time to raise antibodies against predetermined amino acid sequences 
of PRTGF-a species. Amino acid sequences representing PRTGF-a 
species fragments (other than TGF-a) are linked in immunogenic 

15 conjugates to proteins and then used to immunize animals. Such 

antibodies are useful in specific immunoassays for PRTGF-a species 
and in passive immunotherapy. 

The present invention further comprises essentially pure 
20 PRTGF-a species in which the PRTGF-a species are free of 

contaminants with which the are ordinarily associated in the 
non-recombinant cellular environment. Such contaminants are those 
which are normally present with the TGF-a as found in nature, e.g. 
in cells, cell exudates or body fluids, and include human serum 
25 albumin, gansna globulin, lipoproteins, and growth factors such as 

epidermal growth factor (EGF). Other proteins from the source from 
which the PRTGF-a species DMA is obtained may be present in the 
PRTGF-a species compositions herein, e.g. TGF-e and plate! et-deriyed 
growth factor (PDGF), but here they will be known and present in 
3Q predetermined amounts. For example, recombinant cell culture in 
non-human cells enables the production of human TGF-a which are 
absolutely free of other human proteins. 

Tne present invention is also directed to replicable DNA 
35 expression vehicles harboring DMA encoding PRTGF-a species in 
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expressible form, to microorganism strains or cell cultures 
transformed with such DNA and to microbial or cell cultures of such 
transformed strains or cultures, capable of producing PRTGF-a 
species* Still further, this invention is directed to methods for 
5 recombinant fermentative synthesis of PRTGF-a species in said 
microorganisms and cell cultures. 

DNA is provided that encodes PRTGF-a species and which, 
when expressed in recombinant or transformed culture, yields copious 
10 quantities of such PRTGF-a species- This DNA is novel because cDNA 
obtained by reverse transcription of mRNA from PRTGF-a species 
synthesizing cells contains no introns and is free of any flanking 
regions encoding other proteins homologous to the source of the DNA. 

15 Chromosomal DNA encoding PRTGF-a species is obtained by 

probing genomic DNA libraries with cDNA encoding PRTGF-a species. 
Chromosomal DNA is obtained free of its normal chromosomal 
environment and is therefore free of regions flanking the upstream 
5 1 end of the first exon or flanking the 3* downstream end of the 

20 last exon which encodes other proteins homologous to the genomic DNA 
source but will contain introns since the genomic coding sequence 
for human precursor TGF-a contained in six distinct exons. Such DNA 
is useful for transforming mammalian cells to synthesize PRTGF-a 
species as is further described herein. 

25 

The isolated PRTGF-a species DNA is readily modified by 
substitution, deletion or insertion of nucleotides, thereby 
resulting in novel DNA sequences encoding PRTGF-a species (in the 
case of nucleotide substitutions that do not change the encoded 
30 amino acid sequence) or its sequence mutants. Modified DNA 

sequences which do not change the amino acid sequence are useful in 
enhancing the efficiency of PRTGF-a species expression in chosen 
host-vector systems, .e.g.. where a human codon is-mutate-d to a codon 
preferred by an intended host cell. 

35 
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These novel DMA sequences or fragments thereof are labelled 
and used in hybridization assays for genetic material {DNA or mRNA) 
encoding PRTGF-a species. 

In processes for the synthesis of PRTGF-a species, DNA 
which encodes PRTGF-a species is ligated into a replicable 
( reproducible) vector, the vector used to transform host cells, the 
host cells cultured and PRTGF-a species recovered from the culture. 
The PRTGF-a species which are capable of synthesis herein include 
the TGF-a precursor, its fragments such as TGF-aC and TGF-a, and 
derivatives thereof including (a) fusion proteins wherein PRTGF-a 
species (including mature TGF-a) are linked to heterologous proteins 
or polypeptides by a peptide bond at the amino and/or carboxyl 
terminal amino acids of the TGF-a, (b) insertional or substitutional 
mutants of PRTGF-a species wherein one or more amino acid residues 
are mutated and (c) methionyl or modified methionyl, such as fonnyl 
methionyl or other blocked methionyl ami no-terminal addition 
derivatives of the foregoing fusions, fragments or mutants. 

Vectors which comprise DNA encoding PRTGF-a species 
operably ligated to a heterologous secretory leader sequence 
(usually a signal from a protein homologous to the intended host 
organism are used to transform host cells. Host cell processing of 
the resulting PRTGF-a species fusion results in secretion of the 
PRTGF-a species without ami no-terminal methionyl or blocked 
methionyl . 

Also within the scope of this invention are derivatives of 
PRTGF-a species other than variations in amino acid sequence. Such 
derivatives are characterized by covalent or aggregative association 
with chemical moieties. The derivatives generally fall into three 
classes: Salts, side chain and terminal residue covalent 
modifications, and adsorption complexes. 

After a PRTGF-a species has accumulated in direct 



2846Y 



- 9 - 



0154434 



recombinant prokaryotic culture (other than as a fusion with a 
prokaryotic signal sequence) it is separated from other proteins by 
virtue either of its physical form as water insoluble refractile 
bodies or its considerable stability against denaturation. 
Generally, the insoluble matter in the prokaryotic culture is _ 
recovered, refractile bodies ^(which contain the PRTGF-a species) 
separated from insoluble cell debris and the refractile bodies 
solubilized. Optionally, glutathione treatment will follow this 
step in order to enhance proper refolding of the protein. PRTGF-a 
species from recombinant eukaryotic cell culture is water soluble 
and does not require resolubilization; it is purified using 
conventional methods heretofore employed in isolating PRTGF-a 
species from natural sources. 

Purified PRTGF-a species from recombinant cell culture are 
combined for therapeutic use with physiologically innocuous stabi- 
lizers and excipients and prepared in dosage form as by lyophiliza- 
tion in dosage vials or, preferably storage in aqueous preparations. 
The latter is feasible because PRTGF-a species are quite stable to 
thermal denaturation due to the prevalence of disulfide bonds in the 
molecule. Alternatively, PRTGF-a species are incorporated into a 
polymer matrix for implantation into or attachment onto surgical 
sites, e.g. in bandages, thereby effecting a timed-release of the 
PRTGF-a species in a localized high gradient concentration. 

PRTGF-a species-containing compositions are administered to 
animals, particularly patients requiring accelerated tissue growth, 
in therapeutically effective doses. Suitable dosages will be 
apparent to the artisan in the therapeutic context. 

Brief Description of the Drawings 

Figure 1- shows oligonucleotides used as hybridization 
probes for the detection of the DNA sequence for human precursor 

TGF-a. 
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Figure 2 shows a nucleotide sequence of the 180 bp Sau3A- 
fragment of plasmid pTGF 15-1, containing the exon coding for the 
first 33 amino acids of human TGF-a. The deduced amino acid 
sequence is shown. The sequence in capital letters is part of the 
TGF-a polypeptide, while the smaller letter type shows the amino 
acid sequence preceding TGF-a in the precursor. The arrows indicate 
the acceptor and donor site of intervening sequences as determined 
by comparison with the cDNA sequence. Some relevant restriction 
sites are indicated. 

Figure 3 shows the nucleotide sequence and deduced amino 
acid sequence of the cDNA contained in plasmid pTGF-Cl . The G-C 
tails flank the cDNA at both sides. Numbers above each line refer 
to the amino acid position, assuming that the single methionine 
constitutes the NH 2 -terminus. The amino acid sequence for 
TGF-a is boxed and bounded at both sides with an 
Ala-Val rich sequence (overlined residues). Some 
relevant restriction sites are indicated. 

Figure 4 shows a schematic representation of the 
construction pathway for plasmids pTEl, 2, 3, 4, 5, 6, 7, 8. 

Figure 5 is a schematic representation of the TGF-a 
expression plasmids pTE5 and pTE6 indicating the amino acid junction 
of the TGF-a fusion proteins. 

Figure 6 shows the results upon electrophoresis in a SDS-13 
percent polyacryl amide gel (79) of total lysates of E. coli 
containing the expression plasmids pTE2, pTE3, pTE5, or pTE6. The 
arrows indicate the TGF-a fusion proteins. The values at the right 
show the positions of the protein markers. 

Figure 7 shows an SDS-polyacryl amide gel of the bacterial _ 
short TGF-a fusion protein, enriched by the acid-ethanol method. 
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The 68 residue TGF- a fusion protein migrates as a broad band (arrow) 
in this gel. The values at the right show the positions of the 
reference protein markers. 

Figure 8A shows the competition of the bacterial TGF-a 
short fusion protein with l25 l -label led EGF in a radioreceptor 
assay (solid line), performed and graphically represented as 
described (5, 14, 27). The calibration curve with EGF is shown as a 
dashed line. Figure 8B shows the EGF calibration curve. The 
ordinate shows the binding of 125 I -labelled EGF to the cells. 

Figure 9 shows soft-agar colony-forming activity of murine 
EGF, the bacterial TGF-a fusion protein, before and after cleavage 
with cyanogen bromide. The assay was performed in the presence of 
human TGF-p using MRK cells, clone 49F, as described (14). The 
ordinate scores the number of colonies larger than 850 urn 2 , while 
the abscissa indicates the concentration of EGF or bacterial TGF-a, 
expressed in EGF equivalents (ng/ml), as determined in the EGF, 
while the solid lines give the results for the bacterial TGF-a 
fusion protein before (open circle) or after {closed circle) 
treatments with cyanogen bromide. 

Figure 10 depicts plasmid pyTE2 for yeast expression of 
TGF-a. This plasmid has the TGF-a sequence (dashed) with the 
25 preceding a-factor (MF-a) prepro sequence under the transcriptional 

control of the a-factor promoter. The TGF-a sequence is followed by 
the "Able" gene 3' untranslated region and polyadenylation signal 
(open box). The TRP1 functions as the selection marker in yeast. 
The replication in yeast is assured by the presence of the 2n 
30 replication origin. AP R : ampicillin resistance. The junction 

between the a-f actor prepro sequence and the TGF-a sequence is shown 
at the right of the plasmid map. 



5 



10 



35 



2846Y 



- 12 - 



Detailed Description 

PRTGF-a species are defined for the purposes herein as 
polypeptides other than EGF which have a substantial region showing 
functional amino acid homology with the TGF-a precursor amino acid 
sequence set forth in Fig. 3 % or a fragment thereof. A candidate 
polypeptide is substantially homologous with PRTGF-a species as 
defined herein if greater than about 35 percent of the residues in 
the candidate correspond to a sequence within the PRTGF-a species 
sequence, without making conservative amino acid substitutions or 
introducing gaps. Ordinarily a candidate polypeptide, in addition 
to such functional homology, will be capable of exhibiting 
biological activity in common with its homologous PRTGF-a species. 

The degree of amino acid sequence homology which brings a 
polypeptide within the scope of the definition of PRTGF-a species 
will vary depending upon whether the homology between the candidate 
protein and PRTGF-a species falls within or without the PRTGF-a 
species regions responsible for biological activity; domains which 
are critical (a} for inducing morphological changes in target cells, 
(b) for immunological cross-reactivity with antisera raised against 
PRTGF-a species as may occur in non-recombinant sources, or (c) for 
cell surface receptor binding should exhibit a high degree of 
homology in order to fall within the definition, while sequences not 
involved in maintaining these functions show comparatively low 
homology. In addition, critical domains may exhibit one or more of 
these functions and yet remain homologous as defined herein if 
residues containing functionally similar amino acid side chains are 
substituted. Functionally similar refers to dominant characteristics 
of the side chains such as basic, neutral, hydrophobic or acid, or 
the presence or absence of steric bulk. 

Generally a polypeptide defined as PRTGF-a species will 

contain regions substantially homologous with the Fig. 3 protein or 
fragments thereof over a continuous domain of at least about from 10 
to 25 amino acid residues. 
2846Y ; 
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A significant factor in establishing the identity of a- 
polypeptide as a PRTGF-a species is the ability of antisera which 
are capable of substantially binding to the non recombinant 
counterpart to also bind to the activity of the polypeptide in 

5 question. However it will be recognized that immunological identity 
and identity as to other biological activity is not necessarily 
coextensive. For example, a neutralizing antibody for the receptor 
binding activity of the mature TGF- a of Fig. 3 may not bind a 
candidate protein because the neutralizing antibody happens to not 

10 be directed to specifically bind a site on mature TGF-a that is 
critical to its activity. Instead, the antibody may bind an 
innocuous region and exert its neutralizing effect by steric 
hinderance. Therefore a candidate protein mutated in this innocuous 
region might no longer bind the neutralizing antibody, but it would 

15 nonetheless fall within the definition of TGF-a in terms of its 
substantial homology with TGF-a. 

The language "capable" of biological activity means 
polypeptides which can be converted, as by enzymatic hydrolysis, 

20 from an inactive state anologous to a zymogen to a polypeptide 

fragment which exhibits the desired biological activity. Typically, 
inactive precursors will be fusion proteins in which PRTGF-a species 
is linked by a peptide bond at either terminus to a heterologous 
protein or fragment thereof. The sequence at this peptide bond is 

25 selected so as to be susceptible upon proteolytic tydrolysis to 
release PRTGF-a species, either in vivo or as part of a 
manufacturing protocol, in vitro . 

While PRTGF-a species ordinarily means human PRTGF-a 
3 q species, PRTGF-a species from sources such as murine, porcine, 
equine or bovine are included within the definition of PRTGF-a 
species so long as they otherwise meet the standards described above 
for homologous regions- Mature TGF-a in all cases, however, is 
"human" TGF-a* 

35 
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Derivatives of PRTGF-a species factor are included within 
the scope of the term. Derivatives include amino acid sequence 
mutants, glycosylated variants and covalent or aggregative 
conjugates with other chemical moieties. Covalent derivatives are 

5 prepared by linkage of functionalities to groups which are found in 
the PRTGF-a species amino acid side chains or at the N- or 
C-tennini, by means known in the art. These derivatives may, for 
example, include: Aliphatic esters or amides of the carboxyl 
terminus or residues containing carboxyl side chains, e.g., asp32 or 

10 49; 0-acyl derivatives of hydroxy! group-containing residues such as 
ser31", ser3, ser42, serl56 or ser94; and N-acyl derivatives of the 
amino terminal amino acid or amino-group containing residues, e.g. 
lysine or arginine. The acyl group is selected from the group of 
alkyl -moieties (including C3 to CIO normal alkyl), thereby forming 

15 alkanoyl species, and carbocyclic or heterocyclic compounds which 
forming aroyl species. The reactive groups preferably are 
difunctional compounds known per se for use in cross-linking 
proteins to insoluble matrices through reactive side groups. 

20 Covalent or aggregative derivatives are useful as reagents 

in immunoassay or for affinity purification procedures. For 
example, PRTGF-a species are insolubilized by covalent bonding to 
cyanogen bromide-activated Sepharose by methods known per se or 
adsorbed to polyolefin surfaces (with or without glutaraldehyde 

25 cross-linking) for use in the assay or purification of anti-PRTGF-a 
species antibodies or cell surface receptors. PRTGF-a species also 
are labelled with a detectable group, e.g., radioiodinated by the 
chloramine T procedure, covalently bound to rare earth chelates or 
conjugated to another fluorescent moiety for use in diagnostic 

30 assays, especially for diagnosis of PRTGF-a species levels in 
biological samples by competitive-type immunoassays. 

TGF-aN and TGF-aC are useful as immunogens (alone or as an 
immunogenic conjugate wittra heterologous protein) for raising 
35 antibodies against the portions of the TGF-a precursor. In a 
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two-site sandwich specific receptor binding assay, the purpose of 
which is to distinguish the TGF-a precursor from mature TGF-a, 
TGF-aN and TGF- a C, anti -TGF-aN antibody is immobilized prior to or 
during the course of the assay procedure, a test sample is incubated 

5 with the antibody in order to permit the precursor to bind thereto, 
and then anti-TGF-aC is incubated with the bound precursor. The 
anti-TGF-aC is labelled before incubation, for example by 
radioodination, or afterwards, for example by incubation with 
labelled IgG directed against the I gG of the species in which the 

10 anti-TGF-aC was raised. 

Antisera are raised against the predetermined PRTGF-a 
species fragments by cross! inking them to immunogenic proteins such 
as keyhole limpet hemocyanin (KLH) or serum albumin by the use of 

15 covalent agents such as gl utaraldehyde or succinate anhydride 

immunizing suitable animals such as mice or rabbits by subcutaneous 
injection with conventional adjuvants, boosting as necessary and 
recovering antisera. Monoclonal antibodies are prepared from spleen 
cells of immunized mice in conventional fashion, e.g. 

20 immortalization with EB virus or by cell fusion. 

In a method for the determination of TGF-aC or TGF-aN 
without interference from mature TGF-a, antisera are raised against 
predetermined fragments at opposite ends of the TGF-aC or TGF-aN 

25 molecules and the resulting two antisera are employed in a sandwich 
assay as described above for the assay of the TGF-a precursor. In 
the case of TGF-aC, for example, the sequence (Cys) a Arg His Glu 
Lys Pro Ser Ala Leu Leu Lys Gly Arg Thr Ala (Cys)^, wherein a or 
b, but not both, is 1, is conjugated to KLH by disulfide bonds and 

30 rabbits immunized with the conjugate. The antisera are harvested 
and stored. Similarly, rabbits are immunized against the TGF-aC 
sequence His Cys Glu Trp Cys Arg Ala Leu lie Cys Arg linked to KLH 
by the use . of succinic a nhydr idfLJit pH6. These two antisera, are ; „, 
useful in competitive or sandwich immunoassays. Alternatively, 

35 rabbits may be immunized against the entire TGF-aN or TGF-aC 

2846Y 
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polypeptides and two antisera for assay of each polypeptide selected 
based on their ability to not competitively inhibit one another for 
binding to TGF-aN or TGF- a C as the case may be. Competitively 
inhibiting antisera are still useful in competitive- type assays for 
proteins encompassing the fragment against which antisera were 
raised, but one will be unable to readily distinguish such other 
proteins from the fragment. TGF-omC most conveniently is assayed in 
body fluid samples such as urine by a sequential or simultaneous 
sandwich immunoassay in which one of the TGF-aC antibodies is used 
in concert with an anti-mature TGF-a antibody. 

It will be understood that natural allelic variations in 
PRTGF-a species exist and occur from individual to individual. 
These variations may be demonstrated by one or more amino acid 
deletions, substitutions or insertions. Other mutants are 
predetermined variations in the PRTGF-a species sequence made by 
site directed mutagenesis of the PRTGF-a species DNA. 

The objective of site directed mutagenesis is to construct 
DNA that encodes PRTGF-a species as defined, but which species also 
exhibit improved properties and activity. Mutant PRTGF-a species 
are defined as a polypeptide otherwise falling within the homology 
definition for PRTGF-a species set forth herein but which have an 
amino acid sequence different from that of PRTGF-a species whether 
by way of deletion, substitution or insertion. For example, the 
lysine residue at position 96 or 97 may be mutated to histidine or 
another amino acid residue which no longer permits the protein to be 
proteolytically cleaved at this site. Similarly, cysteine 47, 55, 
60, 71, 73 and/or 82 could be replaced by serine, or the carboxyl 
and/or amino terniinus of PRTGF-a deleted. It is not necessary that 
mutants have all of the biological characteristics of PRTGF-a 
species where the mutants retain at least one epitopic site which is 
cross -reactive with antibody to PRTGF-a species. 
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While the mutation site is predetermined in the PRTGF-a 
species mutations of this invention it is not necessary that the 
mutation per se be predetermined. For example, in order to optimize 
the performance of the position 47, 55, 60, 71, 73 or 82 mutants 

5 random mutagenesis may be conducted at the cysteine codons and the 

expressed PRTGF-a species mutants screened for the optimal 
combination of biological activity and compatibility with 
intracellular conditions in prokaryotes, i.e-, solubility upon 
direct expression. Techniques for making substitution mutations at 

10 predetermined sites in DNA having a known sequence are well known, 
for example M13 primer mutagenesis, but here the small size of 
PRTGF-a species facilitates chemical synthesis of the desired DNA 
having predetermined mutations. 

15 Mutagenesis is conducted by making amino acid insertions, 

usually on the order of about from 1 to 10 amino acid residues, or 
deletions of about from 1 to 30 residues. Substitutions, deletions, 
insertions or any subcombination may be combined to arrive at a 
final construct. Insertions include amino or carboxyl -terminal 

20 fusions, e.g. a hydrophobic extension added to the C- or N- terminus 
of mature TGF-a. Obviously, the mutations in the encoding DNA must 
not place the sequence out of reading frame and preferably will not 
create complementary regions that could produce secondary mRNA 
structure. 

25 

Not all mutations in the DNA which encode PRTGF-a species 
will be part of the final product. For example, a major class of 
DNA insertional mutations are those in which a heterologous 
secretory leader, or signal, has been appended to the N-terminus of 

30 the PRTGF-a species* Alternatively, N-terminal PRTGF-a species 
fusions with non-secretory heterologous polypeptides are 
contemplated by this invention where such polypeptides can be 
cleaved, far example by cyan-ogen bromide or enzymes, -from the - 
PRTGF-a species in order to yield unmethionylated PRTGF-a species. 

35 For example, in constructing a procaryotic expression vector the 
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£• coli alkaline phosphatase or heat stable enterotoxin II leaders 
are placed 5 1 from the PRTGF-a species sequence in reading frame 
therewith. Yeast invertase, alpha factor or acid phosphatase 
leaders are similarly used in yeast expression of unmethionylated 

5 PRTGF-a species- However, the native TGF-a precursor secretoiy 
leader may be recognized by hosts other than cells of its origin, 
most likely in cell culture of higher eukaryotic cells. When the 
secretory leader is "recognized" by the host, the fusion protein 
consisting of PRTGF-a species and the leader ordinarily is cleaved 

10 at the peptide bond joining PRTGF-a species and the signal in the 
events that lead to secretion of the PRTGF-a species. Thus, even 
though a mutant PRTGF-a species DNA is used to transform the host, 
and mutant prePRTGF-a species (a fusion) is synthesized as an 
intermediate, the resulting PRTGF-a species is not a fusion. 

15 

As used herein, transforming growth factor-B (TGF-e) 
denotes transforming growth factor of the 8 -type with the phenotype 
of naturally occurring TGF-b, e.g. capable of potentiating TGF-a or 
EGF (epidermal growth factor) for anchorage independent growth. 

20 

DNA encoding PRTGF-a species is covalently labelled with a 
detectable substance such as a fluorescent group, a radioactive atom 
or a chemi luminescent group by methods known per se . It is then 
used in conventional hybridization assays. Such assays are employed 
25 in identifying PRTGF-a species vectors and transfonnants as 

described in the Examples infra , or for in vitro diagnosis such as 
detecton of PRTGF-a species mRNA in tumor cells. 

The mRNA for PRTGF-a species, surprisingly, is relatively 
30 rare. This makes the cDNA easy to overlook were one not apprised as 

to what to look for. However, once its presence is appreciated and 
completely complementary DNA made available, as is enabled by the 
disclosures herein, it is routine to screen cDNA libraries of tumor 
cells for PRTGF-a species cDNA using probes having complementary 
35 sequences. 
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PRTGF-a species are synthesized by host cells, transformed 
with expression vectors containing DNA which encodes PRTGF-a 
species. Expression vectors include vectors which together with a 
host cell are capable of expressing DNA sequences contained therein, 
where such sequences are operably linked to other sequences capable 
of effecting their expression. These vectors must be replicable in 
the host organisms either as episomes or as an integral part of the 
chromosomal DNA. In general, expression vectors will be plasmids, 
circular doubled stranded DNA loops which, in their vector form, are 
not bound to the chromosome. In the present specification, 
"plasmid" and "vector" generally are used interchangeably as 
plasmids are the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression 
vectors, e.g. cotransformation vectors and viruses which serve 
equivalent functions. 

DNA regions are operably linked when they are functionally 
related to each other. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it 
is expressed as a preprotein which participates in the secretion of 
the polypeptide; a promoter is operably linked to a coding sequence 
if it controls the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is 
positioned so as to permit translation. Generally, operably linked 
means contiguous and, in the case of secretory leaders, contiguous 
and in reading phase. 

Recombinant host cells are cells which have been 
transformed with the above-described vectors. As defined herein, 
PRTGF-a species are produced in the amounts achieved by virtue of 
this transformation, rather than in such lesser amounts, or, more 
commonly, in such less than detectable amounts, as might be produced 
by the untransformed host. PRTGF-a species produced by such cells 
are referred to as recombinant PRTGF-a specieis. 
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The vectors and methods disclosed herein are suitable for 
use in host cells over a wide range of prokaryotic and eukaryotic 
organisms* 

In general, of course, prokaryotes are preferred for 
cloning of DNA sequences in constructing the vectors useful in the 
invention. For example, E. coli K12 strain 294 (ATCC No. 31446) is 
particularly useful. Other microbial strains which may be used 
include col i strains such as col i B, and E. coli X1776 {ATCC 
No. 31537). These examples are, of course, intended to be 
illustrative rather than limiting. 

Prokaryotes may also be used for expression, although our 
experience has shown that PRTGF-a species sequences generally are 
deposited in the prokaryotic cell cytoplasm as insoluble refractile 
bodies. These are readily recovered and resolubilized. The 
aforementioned strains, as well as E. col i W3110 (F-x-, 
prototrophic, ATCC 27325), bacilli such as Bacillus subtilus , and 
other enterobacteriaceae such as Salmonella typhimurium or Serratia 
marcescens , and various Pseudomonas species may be used. 

In general, plasmid vectors containing replicon and control 
sequences which are derived from species compatible with the host 
cell are used in connection with these hosts. The vector ordinarily 
carries a replication site, as well as marking sequences which are 
capable of providing phenotypic selection in transformed cells. For 
example, JE. coli is typically transformed using pBR322, a plasmid 
derived from an E_. coli species (55). pBR322 contains genes for 
ampicillin and tetracycline resistance and thus provides easy means 
for identifying transformed cells. The pBR322 plasmid, or other 
microbial plasmid must also contain, or be modified to contain, 
promoters which can be used by the microbial organism for expression 
of its own protein- Those promoters most commonly used in 
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recombinant DNA construction include the B-lactamase (penicillinase) 
and lactose promoter systems (53, 72, 92) and a tryptophan (trp) 
promoter system (67, 93). While these are the most commonly used, 
other microbial promoters have been discovered and utilized, and 
details concerning their nucleotide sequences have been published, 
enabling a skilled worker to ligate them functionally with plasmid 
vectors (80). 

In addition to prokaryotes, eukaryotic microbes, such as 
yeast cultures, may also be used. Yeast express and secrete mature 
TGF-a at lower levels than bacteria, but the polypeptide is soluble 
unlike the direct expression product from E. coli . Saccharomyces 
cerevisiae , or common baker's yeast is the most commonly used among 
eukaryotic microorganisms, although a number of other strains are 
commonly available. For expression in Saccharomyces , the plasmid 
YRp7, for example, (81, 82, 83) is commonly used. This plasmid 
already contains the trp! gene which provides a selection marker for 
a mutant strain of yeast lacking the ability to grow in tryptophan, 
for example ATCC No. 44076 or PEP4-1 (84). The presence of the trp! 
lesion as a characteristic of the yeast host cell genome then 
provides an effective environment for detecting transformation by 
growth in the absence of tryptophan. 

Suitable promoting sequences in yeast vectors include the 
promoters for 3-phosphoglycerate kinase (85) or other glycolytic 
enzymes (85, 87), such as enolase, glyceral dehyde-3-phosphoglycerate 
mutase, pyruvate kinase, triosephosphate isomerase, phosphogl ucose 
i some rase, glucokinase and a-f actor. In constructing suitable 
expression plasmids, the termination sequences associated with these 
genes are also ligated into the expression vector 3* of the sequence 
desired to be expressed to provide termination of the mRNA and 
polyadenylation. Other promoters, which have the additional 
advantage of transcription controlled by growth -conditions are the 
promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen 
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metabolism, and the aforementioned glyceraldehyde-3-phosphate 
dehydrogenase, and enzymes responsible for maltose and galactose 
utilization (87). Any plasmid vector containing yeast-compatible 
promoter, origin of replication and termination sequences is 
5 suitable. 

In addition to microorganisms, cultures of cells derived 
from multicellular organisms may also be used as hosts. In 
principle, any such cell culture is workable, whether from 

10 vertebrate or invertebrate culture. However, interest has been 

greatest in vertebrate cells, and propagation of vertebrate cells in 
culture (tissue culture) has become a routine procedure in recent 
years (75). Examples of such useful host cell lines are VERO and 
HeLa cells, Chinese hamster ovary (CHO) cell lines, and WI38, BHK, 

15 COS-7 and HDCK cell lines. Expression vectors for such cells 
ordinarily include (if necessary) an origin of replication, a 
promoter located in front of the PRTGF-a species sequence to be 
expressed, along with any necessary ribosome binding sites, RNA 
splice sites, polyadenyl ation site, and transcriptional terminator 

20 sequences. 

For use in mammalian cells, the control functions on the 
expression vectors are often provided by viral material. For 
example, commonly used promoters are derived from Polyoma, 

25 Adenovirus 2, and most frequently Simian Virus 40 (SY40). The early 
and late promoters of SV40 virus are particularly useful because 
both are obtained easily from the virus as a fragment which also 
contains the SY40 viral origin of replication (88) incorporated 
herein by reference. Smaller or larger SV40 fragments may also be 

30 used, provided there is included the approximately 250 bp sequence 

extending from the Hind III site toward the B£ll site located in the 
viral origin of replication. Further, it is also possible, and 
often desirable, to utilize promoter or control sequences normally 
associated with the desired gene sequence, provided such control 

35 sequences are compatible with the host cell systems. 
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An origin of replication may be provided either by 
construction of the vector to include an exogenous origin, such as 
may be derived from SV40 or other viral (e.g. Polyoma, Adeno, VSV, 
BPV, etc.) source, or may be provided by the host cell chromosomal 
5 replication mechanism. If the vector is integrated into the host 

cell chromosome, the latter is often sufficient. 

Rather than using vectors which contain viral origins of 
replication, one can tranform mammalian cells by the method of 

10 cotransfonnation with a selectable marker and the PRTGF-a species 
DNA. *An example of a suitable selectable marker is di hydrofolate 
reductase (DHFR). In selecting a preferred host mammalian cell for 
transfection by vectors which comprise DNA sequences encoding both a 
PRTGF-a species and DHFR, it is appropriate to select the host 

15 according to the type of DHFR protein employed. If wild type DHFR 
protein is employed, it is preferable to select a host cell which is 
deficient in DHFR thus permitting the use of the DHFR coding sequence 
as a marker for successful transfection in selective medium which 
lacks hypoxanthine, glycine, and thymidine. An appropriate host 

20 cell in this case is the Chinese hamster ovary (CHO) cell line 

deficient in DHFR activity, prepared and propagated as described by 
Urlaub and Chasin, 1980, M Proc. Natl. Acad. Sci."(USA) 77: 4216. 
(^transformation is further described in U.S. patent 4,399,216; the 
procedures therein are adapted for use in PRTGF-a species synthesis 

25 by substitution of DNA encoding a PRTGF-a species sequence for the 
genomic or B-globin DNA used in the cited patent using appropriate 
synthetic linkers as required. 

On the other hand, if DNA encoding DHFR protein with low 
30 binding affinity for methotrexate (MTX) is used as the controlling 
sequence, it is not necessary to use DHFR resistant cells. Because 
the mutant DHFR is resistant to MTX, MTX containing media can be 
used as a means of selection provided that the host cells are 
themselves MTX sensitive. Most eukaryotic cells which are capable 
35 of adsorbing MTX appear to be methotrexate sensitive. One such 

2846Y 



- 24 - 



9154434 



useful cell line is a CHO line, CH0-K1 (ATCC No. CCL 61). 

The method by which PRTGF-a species are recovered from cell 
culture will depend upon whether or not they are expressed as a 
5 soluble protein or a refractile body (insoluble aggregate). The 
latter will usually be the case with bacterial expression, where 
purification is readily accomplished because most cell proteins are 
soluble. 

10 Soluble PRTGF-a species are more readily recoverable from 

cell culture if expressed in a secretory host vector system, e.g. if 
linked to a bacterial or yeast secretory leader, or if a vertebrate 
cell line is transformed with the entire TGF-a precursor sequence 
including its normal signal sequence. 

15 

Soluble PRTGF-a species may be purified using 
alkyl-sepharose chromatography, gel sieving, gel electrophoresis, or 
receptor-binging affinity chromatography using immoblized, 
antibodies, or receptors. 

20 

Compositions containing PRTGF-a species are prepared for 
administration to patients by mixing PRTGF-a species having the 
desired degree of purity with physiologically acceptable carriers, 
i.e., carriers which are nontoxic to recipients at the dosages and 
25 cor :entrations employed. Ordinarily, this will entail combining the 
PRTGF-a species with buffers, antioxidants such as ascorbic acid, 
low molecular weight (less than about 10 residues) polypeptides, 
proteins, amino acids, carbohydrates including glucose or dextrins, 
chelating agents such as EDTA, and other stabilizers and excipients. 

30 

Also administered to animals are compositions containing 
PRTGF-a species immunogenic conjugates or PRTGF-a species conjugates 
and antibodies capable of binding to PRTGF-a species, the "former "to 
raise antibodies particularly for use in diagnostic kits, the latter 
35 to generate anti -anti body an ti sera. 
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The route of administration of such compositions is in 
accord with known methods, e.g. intravenous, intraperitoneal, 
intramuscular or intralesional infusion or injection of sterile 
therapeutic solutions, or by timed release systems as noted below. 

PRTGF-a species compositions may be administered from an 
implantable timed-release article. Examples of suitable systems 
include copolymers of L-glutamic acid and gamma ethyl -L-glutamate 
(U. Sidman et aU, 1983, "biopolymers" 22 (1): 547-556), poly 
(2-hydroxethyl-methacrylate) (R. Langer et alk , 1981, "J. Biomed. 
Mater.- Res." 15: 167-277 and R. Langer, 1982, "Chem. Tech." 12: 
98-105) or ethylene vinyl acetate (R. Langer et aK, Id.}. The 
article is implanted at surgical sites or over wounds. 
Alternatively, the compositions may be encapsulated in semipermeable 
microcapsules or liposomes for injection. 

The amount of said compositions that is administered will 
depend, for example, upon the route of administration, the target 
disease and the condition of the recipient. Intralesional 
injections will require less composition on a body weight basis than 
will intravenous infusion. Accordingly, it will be necessary for 
the therapist to titer the dosage and modify the route of 
administration as required to obtain optimal activity as can be 
determined for example by biopsy of the target tissue or diagnostic 
assays. 

TGF-a is a potent bone resorption agent. Accordingly, it 
is therapeutically useful for this purpose. It is adminstered by 
injection, infusion or timed release and the dose is titered by 
following the plasma calcium ion levels; to induce bone resorption 
the dose is titered to generate hypercalcemia. In some patients 
just the converse is the proper therapeutic approach. These 
patients are those who suffer from carcinoma and, as the case with 
some tumors, hypercalcemia attendant the synthesis of PRTGF-a 
species by the tumor cells. These patients are identified by the 
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presence of above-normal concentrations of TGF-omC or TGF-a in 
urine, serum or surgically excised tumor tissue if available- The 
therapeutic regimen involves administering to such patients a TGF-a 
neutralizing agent such as anti-TGF-a (or its antigen-specific Fab 

5 region) or a TGF-ct receptor such as the EGF receptor (or its 

TGF-a-binding amino terminal extracellular domain). Methods are 
described above for producing antisera to selected TGF-a domains. 
The antibodies then are screened for their ability upon injection, 
infusion or timed release to correct the hypercalcemia of such 

10 patients. Alternatively, the antibody is generated in situ by 

immunizing the patient against TGF-a or a selected domain thereof as 
described above. 

The nucleotide and amino acid sequence for the EGF receptor 

15 are known (A. Ullrich et a]_. f May 1984, "Mature" 309: 418-425). In 
addition EGF receptor is obtainable from A431 epidermal carcinoma 
cells, a publicly available cell line having about 10-50 times more 
EGF receptor on their surface than most other cell types. A431 
cells also secrete a truncated receptor having on the extracellular 

20 EGF binding domain (A. Ifll rich et a]_. , I_d.). Either EGF receptor 
species is purified according to known methods (M. Waterfield et 
al 1982 "J. Cell Biochem." 210 : 149-161) or equivalent techniques 
known to those skilled in the art such as affinity chromatography on 
EGF linked to cyanogen bromide activated sepharose. They are 

25 formulated in phannaceutically acceptable carriers such as sterile 
saline in concentrations therapeutically effective upon infusion to 
bind free TG r -a and thereby lower the serum calcium level. 
Therapeutic efficacy is monitored by the reduction in hypercalcemia 
or by assay of free versus bound TGF-a in fashion analogous to 

30 immunoassays presently employed for the assay of free thyroxine in 

plasma. The use of the EGF receptor or the TGF-a receptor {or their 
TGr-a-bincir: regions) is preferred over antibodies because of ease 
of manufacture. However, antibodies have the advantage that they 
can be selected having affinities greater than those of naturally 

35 occuring receptors. 
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Methods Employed 

If cells without formidable cell wall barriers are used as 
host cells, transfection is carried out by the calcium phosphate 
5 precipitation method as described in (89). However, other methods 
for introducing DNA into cells such as by nuclear injection or by 
protoplast fusion may also be used. 

If prokaryotic cells or cells which contain substantial 
10 cell wall constructions are used, the preferred method of 

transfection is calcium treatment using calcium chloride as 
described by (90). 

Construction of suitable vectors containing the desired 
1 5 coding and control sequences employ standard ligation techniques. 
Isolated plasmids or DNA fragments are cleaved, tailored, and 
reli gated in the form desired to form the plasmids required. 

Cleavage is performed by treating with restriction enzyme 
20 (or enzymes) in suitable buffer. In general, about 1 ug plasmid or 
DNA fragments are used with about 1 unit of engine in about 20 nl of 
buffer solution. (Appropriate buffers and substrate amounts for 
particular restriction enzymes are specified by the manufacturer.) 
Incubation times of about 1 hour at 37°C are workable. After 
25 incubations, protein is removed by extraction with phenol and 
chloroform, and the nucleic acid is recovered from the aqueous 
fraction by precipitation with ethanol. 

If blunt ends are required, the preparation can be treated 
30 for 15 minutes at 15°C with 10 units of Polymerase I (Klenow), 
phenolchloroform extracted, and ethanol precipitated. 

Size separation of the cleaved fragments is often performed 
using 6 percent polyacryl amide gel, e.g. as described by Goeddel , 
35 et al. (67). 
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For ligation, the desired components, suitably end tailored 
to provide correct matching, are treated with about 10 units T4 DNA 
ligase per 0.5 u9 DNA. (When cleaved vectors are used as 
components, it may be useful to prevent religation of the cleaved 
vector by pretreatment with bacterial alkaline phosphatase.) 

For analysis to confirm correct sequences in plasmids 
constructed, the ligation mixtures are used to transform £. coli K12 
strain 294 (ATCC 31446), and successful transformants selected by 
ampicillin or tetracyclin resistance where appropriate. Plasmids 
from the transformants are prepared, analyzed by restriction enzymes 
and/or sequenced (45, 91). 

General Description of Preferred Embodiments 

Examples 

The following examples are intended to illustrate but not 
to limit the invention. In the examples here an £. coli host 
culture was employed as host cell culture. However, other 
eukaryotic and prokaryotic cells are suitable for the method of the 
invention as well . 

1. Isolation of a TGF-g specific genomic DMA clone . 

Isolation of a TGF-a gene is based upon specific 
hybridizatin with synthetic oligonucleotides. These probes were 
designed on the basis of a preliminary partial amino acid sequence 
for human TGF- a and are shown in Fig. 1. 

It has recently been shown that specific DNA sequences can 
be isolated from cDNA or genomic DNA libraries by hybridization 
under low stringency conditions with long synthetic oligonucleotides 
which contain many mismatches. This approach was successful in the 
detection of the gene for human insulin-like growth factor I as set 
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forth in U.S. Appln. Serial No. 501,351 filed June 6, 1983 (EPO 
Application No. 84.3037847), incorporated herein by reference. A 
similar approach was attempted for the isolation of the DNA sequence 
coding for human TGF-a. Two long oligonucleotides were synthesized 
for this purpose (Figure 1). A 41-mer corresponds to a sequence, 
coding for amino acids 12 to 25, while the 48-mer is complementary 
to a sequence coding for amino acids 1 to 16. Since several 
different codons are possible for each amino acid, nucleotide choice 
of the oligonucleotides was based on the codon bias observed in 
human mRNAs (94). Also, the presence of multiple CpG di nucleotides 
was avoided. Since the 3' ends of the 41-mer and the 48-mer are 
complementary over a length of 15 residues, it is possible to 
hybridise both oligonucleotides and extend them with avian 
myeloblastosis virus (AMV) reverse transcriptase, thereby creating a 
double-stranded 74-mer. In addition to these long oligonucleotides, 
two sets of 14-mers were synthesized. Four pools of 14-mers, 
designated 1A, B, C and D are complementary to all possible codons 
for amino acids 5 to 9. Similarly, four other pools of 14-mers, 
named 2A through D correspond to amino acids 15 to 19. 

These oligonucleotides shown in Figure 1 were used as 
hybridization probes for the detection of the DNA sequence for human 
TGF-a. The oligonucleotides were designed with reference to a 
partial amino acid sequence for human TGF-a (35) and were 
synthesized using the solid phase phosphotriester method (38, 39). 

The 41-mer, 48-mer and the 14-mers were 5'-labeled with 
Y _ 3 2p_ AT p and polynucleotide kinase in a reaction mixture 
containing 70 mM Tris-HCl (pH 7.6),, 10 mM MgCl 2 , 5 mM 
dithiothreitol at 37°C for 30 min. The 74-mer was prepared by 
heating equimolar amounts of the 41-mer and the 48-mer for 5 min. at 
70°C in a 25 ul mixture of 105 mM Tris-HCl, pH 8.3, 140 mM KC1 , 50 
mM MgCl^ an d 210 mM B-mercaptoethanol and gradually cooling over a 
30 min interval to room temperature. These annealed oligonucleotides 
were subsequently extended to double-stranded radioacti vely-labeled 
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74-mers by adding dTTP and dGTP to 10 mM, 150 nCi each of 
a- 32 P-dATP and a- 32 P~dCTP and 80 units of AMY reverse 
transcriptase to a total volume of 70 nl. The incubation was for 30 
min. at 37 °C. Unlabeled dATP and dCTP were then added to 10 mM and 
5 the reaction was allowed to proceed for 15 min. The unincorporated 
nucleotide triphosphates were separated from. the hybridization 
probes by chromatography over a Sephadex G50 Superfine. 

Initially focus was on the isolation of TGF-a specific 

10 clones from cDNA libraries derived from mRNA from the human melanoma 
cell line A2058 (5, 34), which was used as a source for the purifi- 
cation of TGF-a. Extensive screenings with the 14-mers, the 41-mer 
and the 74-mer permitted the isolation of some hybridizing cDNA 
clones, which upon sequence analysis were found to be unrelated to 

15 TGF-a. Because of this lack of success, it was decided to search 
for the TGF-a gene in a human genomic DMA library contained in a x 
Charon 4A phage (40). About 7.5xl0 5 phages were screened by 
hybridization with the 41-mer, which was 5*-labeled with 32 P. In 
another experiment, an equal number of recombinant phages were 

20 screened by replica plating of the recombinant x phages onto nitro- 
cellulose filters (95) and hybridization with the radioacti vely- 
labeled 74-mer. These screenings resulted in the detection and 
isolation of 35 individual recombinant x phages which hybridized 
with the 41-mer and/or the 74-mer. DNA was isolated from all 35 

25 phages. Hybridization with the 41-mer 5 74-mer, 48-mer and the 

14-mer pools 1A-D and 2A-D was assessed by M dot blot" analysis (43) 
for each recombinant x DMA. None of these isolated phage DMAs 
hybridized clearly with the 48-raer or the pools 1A-D, while about 
half of the DNAs showed some hybridization with a mixture of the 

30 14-mer pools 2A-D. 

The 41-mers, 48-mers and 74-mers were hybridized in 5X SSC 
(IX SSC = 0.15 H NaCl, 0.015 M sodium citrate), 5X Denhardt solution 
(IX Denhardt solution =0.1 percent Ficoll , 0.1 percent 
35 polyvinpyroll idone, 0.1 percent bovine serum albumin, 20 percent 
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fonnamide and 50 ug/ml sonicated salmon sperm DNA. 

After the filters were prehybridized for 2 hrs at 42°C, the 
heat denatured probe was added and hybridization took place at 42°C 
for 15-20 hrs. The filters were subsequently washed extensively in 
IX SSC 9 0.1 percent SDS at 37°C. When the 14-mers were used as 
probes, the prehybridization 'for 2 hrs and the hybridization for 15 
hrs were at 37°C in 6X SSC, 0.5 percent NP40, 6 mM EDTA, IX Denhardt 
solution and 50 ug/ml salmon sperm DNA. Several washes were then 
performed in 6X SSC at room temperature. 

The extent of hybridization with the 41-mer and the 14-mers 
2A-D was evaluated by washing the hybridized "dot-blot 11 nitro- 
cellulose filters under increasingly higher stringency. This was 
done in order to restrict further the number of phages to be 
considered as potential candidates for further analysis. Twelve 
phages were selected for sequence determination on the basis of this 
evaluation. These phage DMAs were digested with BamH I , Hind HI or 
the combination of both enzymes and the fragments were separated on 
agarose gel . 

Southern analysis (44) of the phage DNAs, which hybridized 
with the 41-mer or the 14-mer pools 2A-D showed that the sequences 
hybridizing to either of the probes were localized within a same DNA 
segment. The hybridizing BamH I or Hind HI fragment of each phage 
DNA was subsequently subcloned into plasmid pBR322. This chimeric 
plasmids were in turn cleaved with the endonucleases Sau3AI , Rsal , 
or both. The mixture of fragments was separated on polyacryl amide 
and agarose gels and transferred onto nitrocellulose filters. 
Hybridization with the 41-mer permitted the identification of a 
hybridizing small fragment for all 12 plasmids. These fragments 
were subsequently subcloned into M13 mp8 or mp9 (45) and their 
nucleotide sequence was determined by the dideoxy nucleotide chain 
termination method (46). 
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One of the plasmids, designated pTGF15-l, revealed the 
sequence coding for the first 33 amino acids of TGF-a, located 
within a 180 base pair Sau3 AI fragment (Figure 2). pTGF15-l 
contains a 10,2 kilobasepair BarcH I fragment, derived from the 

D recombinant phage xl5. The codon for the 33rd amino acid is 

followed by a stop codon. Th„e GT-di nucleotide at that posiion marks 
the donor site of an intervening sequence (47). Restriction mapping 
of pTGF15-l and further sequence analysis (data not shown) showed 
that the 5au3 AI fragment is located on a 670 bp Sacl-Ball fragment 

10 and that the Sau3AI site downstream of the splice donor site is also 
the recognition site for Bgll l enzyme. 

The nucleotide sequence of the 180 bp Sau3A-f ragment of 
plasmid pTGF15-l, containing the exon coding for the first 33 amino 

15 acids of human TGF-a and the deduced amino acid sequence are shown 
in Figure 2. The sequence in capital letters is part of the TGF-a 
polypeptide, while the small letter type shows the amino acid 
sequence preceding TGF-a in the precursor. The arrows indicate the 
acceptor and donor site of the intervening sequences as determined 

20 by comparison with the cDNA sequence [see below). 

Close examination of this nucleotide sequence shows that 33 
residues of the hybridizing 41-mer are homologous with the obtained 
TGF-a DNA sequence. Fourteen homologous bases are in a continuous 

25 stretch. It is not clear why the 48-mer did not hybridize 

significantly since 37 of the 48 residues are homologous. The 
perfect homology with one of the 14-mers of pool 2D results in a 
clear, although surprisingly weak, hybridization. The lack of 
hybridization with the 14-mers 1A-D is due to the presence of a 

30 codon for aspartic acid at position 7 in the mature TGF-a, instead 
of the lysine which was originally predicted. This difference 
results in the presence of two mismatching residues in these 14-mers. 

The isolated 180 bp Sau3AI fragment of pTGF15-l was 
35 subsequently used as a hybridization probe in a dot blot analysis 
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(43) of the previously isolated 35 recombinant phages. Five of the 
35 phage DNAs hybridized with this fragment. Southern analysis 
shows that they all contain the same hybridizing 670 bp Sacl-Ball 
fragment. 

2. A TGF-a mRNA of about 5000 nucleotides long 

The above described results show that the genomic sequence 
coding for human TGF-a is interrupted by an intervening sequence. 
In order to obtain the full size coding sequence, the genomic 
Sad -Ball fragment containing the TGF-a exon was used to probe cDNA 
libraries derived from mRNA from the melanoma line A2058. Since 
these efforts were again unsuccessful, it was decided to search for 
another cell line as a source of the TGF-a mRNA. 

A collection of different mRNAs, extracted from a wide 
variety of tumor cell lines, was examined by electrophoresis on 
formaldehyde agarose gels (41) and "Northern" hybridization (42) 
with the TGF-a specific SacI -Ball fragment (data not shown). All 
cell lines showed a weakly, probably non-specifically hybridizing 
band at the position of the 28S ribosomal RNA, still present in 
these oligo dT-cellulose selected mRNA preparations. One cell line 
1072 F57, derived from a renal cell carcinoma, showed a clearly 
stronger hybridization signal at the 28S position, indicating the 
presence of TGF-a mRNA of about 4800-5000 nucleotides long. The 
related polypeptide EGF is also encoded by an mRNA of about 4800 
nucleotides long (48, 49). The primary translation product of this 
mRNA is an EGF precursor polypeptide, which is subsequently 
processed into several peptides, one of which is EGF, 

3. Isolation of a cDNA coding for TGF-g 

. ..In-order to .isolate. a cDNA containing, the complete sequence 
coding for human TGF-a, RNA was isolated from the above cell line. 
The polyadenylated mRNA fraction was isolated by absorption to oligo 
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dT-cellulose chromatography (50). cDNA was prepared by conventional 
methods (51-53), tailed with dC-homopolyme rs (54) and annealed into 
the PstI -linearized and dG-tailed pBR322 (55)* Transformation (56) 
in _E. coli 294 (57) was performed using the high efficiency method 

5 of Hanahan (58) and gave rise to three cDNA libraries. One library 
contained cDMAs which were primed with dT 12 -18' wt " le specifically 
primed cDNA synthesis from the synthetic 16-mer dCATGCTGGCTTGTCCT 
was utilized to prepare the two other libraries. This oligonucleo- 
tide is complementary to the downstream region (nucleotides 134 to 

10 149) of the TGF-a exon, contained within the 180 bp long Sau3AI 

restriction fragment of pTGF15-l (Figure 2). The bacterial colonies 
were screened (59) using the radioactively labeled TGF-a specific 
Sad -Ball restriction fragment prepared from plasmid pTGF15-l. Only 
one in 90,000 recombinant JE. coli clones which were prepared by 

15 specifically primed cDNA synthesis, hybridized -with the probe. 
Restriction enzyme analysis of this plasmid, called pTGF-Cl , 
revealed the presence of three PstI fragments, the shortest of which 
is the 67 bp fragment also present in the TGF-a exon of pTGF15-l 
(Fig. 2). The three PstI fragments, which represent the cDNA insert 

20 of about 900 bp, were subsequently subcloned into M13mp8 (45) and 
sequenced with dideoxy chain termination method (45). The cDNA 
sequence of plasmid pTGF-Cl with its deduced amino acid sequence is 
shown in Fig. 3. 

25 Figure 3 shows the nucleotide sequence and deduced amino 

acid sequence of the cDNA contained in plasmid pTGF-Cl. The G-G 
tails flank the cDNA at both sides. The nucleotides are numbered 
beneath each line- Numbers above each line refer to the amino acid 
position, assuming that the single methionine constitutes the 

30 NHg-teraiinus. The amino acid sequence for TGF-a is boxed 
and bounded at: both sides with an Ala-Val rich sequence 
(overlinec residues) _ 



35 Examination of the nucleotide sequence shows that the cDNA 
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synthesis did not initiate at the RNA sequence specified by the 
primer (position 134-149 of the TGF-a exon, Fig. 2), but rather 
downstream of the position corresponding to the specific 
oglionucleotide. A gene fragment which is immediately downstream of 

5 the 3' end of this cDNA was subsequently isolated from a recombinant 
phage and did not reveal a sequence which resembles the specific 
primer. It can therefore be assumed that a random cDNA initiation 
event generated this TGF-a cDNA. It is possible that the secondary 
structure of the TGF-a mRNA precluded specific hybridization with 

10 the oligonucleotide and therefore the specific cDNA initiation. 
Alignment of the sequences of the cDNA (Fig. 3) and the genomic 
fragment from pTGF15-l (Fig. 2) indicates the presence of a splice 
acceptor and donor site (47) in the genomic DNA. The TGF-a exon 
contained with the 180 bp SauA I fragment is thus 121 bp long (Figure 

15 2). 

The recognizable amino acid sequence for TGF-a establishes 
the reading frame in the cDNA sequence. The coding sequence ends at 
nucleotide 527 with a TGA as stop codon and is followed by part of 

20 the 3' untranslated region. The open reading frame continues up to 
the 5' end of the cDNA. It is thus possible that only a part of the 
sequence, coding for the TGF-a precursor, is located on this cDNA, 
especially since the mRNA is about 4800-5000 nucleotides long. It 
is, however, significant that the ATG codon for the single 

25 methionine residue is preceded by an A at the -3 position and is 

immediately followed by a G residue. These features are 
characteristic of the initiation codons in most mRNAs of higher 
eukaryotes (60). In addition, the sequence between positions 8 and 
18 is very characteristic for a hydrophobic core (61) of a signal 

30 peptide involved in protein secretion from the cells. Comparison 

with other signal sequences (61-62) suggests that the cleavage by 
the signal peptidase could occur following the Ala at position 19, 
Cys at position 20 or the Ala at position 22, if this sequence 
indeed- rep resents the signal peptide. However, the assignment of 

35 this single methionine as the start of the TGF-a precursor implies 
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that the 3 1 untranslated sequence of the mRNA would probably have an 
unusually large length of about 4,000 nucleotides. 

The cDNA sequence, shown In Figure 3, reveals the complete 
5 DMA sequence for human TGF-a embedded in a larger coding sequence 
for the precursor protein. Direct amino acid analysis of the rat, 
mouse and human TGF-a (27, 35) has revealed the Val-Val sequence at 
the NH 2 - terminus. Based on the polypeptide length of 50 amino 
acids and on the sequenced carboxyl terminus of the rate and mouse 

10 TGF-a ends with the Leu-Ala residues at positions 88 and 89 (Fig, 

3). In order to generate the 50 amino acid long TGF-a, proteolytic 
cleavage must occur at both the amino- and carboxyl -termini between 
alanine and valine residues. This Ala-Val dimer at the 
NH 2 -terminus, which is located within the sequence 

15 Yal-Ala-Ala-Ala-Yal-Val, is very similar to the Ala-Yal-Val -Ala-Ala 
sequence found at the carboxyl end. A protease with this remarkable 
specificity and which could thus be responsible for the proteolytic 
processing of the TGF- a precursor has not yet been described. 

20 The complete sequence coding for TGF-a has now been 

determined in the cDNA derived from a renal cell carcinoma and in a 
gene fragment isolated from a genomic library which was derived from 
a normal fetal liver (40). Both sequences are identical, indicating 
that there are no coding differences between the TGF-a genes in both 

25 sources. 

Alternatively, having established the DNA coding sequence 
for TGF-a (See Figure 3), the gene can be synthesized using 
conventional methods, such as those described in references (38, 39). 

30 

The deduced amino acid sequence for the precursor of TGF-a 
reveals a very hydrophobic region beginning at 20 amino acids 
downstrea- of the carboxyl terminus of 50 amino acid TGF- a . The$e 
residues 103 to 121 consist almost exclusively of leucines, 
35 isoleucines and valines. The sequence from amino acid 118 to the 
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carboxyl terminus of the precursor polypeptide is remarkably rich in 
cysteines. This sequence of 42 amino acids contains 8 cysteines, 4 
of which are clustered in pairs- This cysteine-rich sequence could 
possibly constitute a biologically active polypeptide. It can as 

5 yet only be speculated how this peptide might be cleaved from the 

precursor molecule. Several polypeptide hormones are synthesized as 
larger precursors and are usually bounded by pairs of basic amino 
acids. The Lys-Lys residues at positions 96-97 could possibly be 
the site of proteolytic cleavage as in the case of preproenkephal in 

-10 (63-64), the calcitonin precursor (65) and the corticotropin-B- 
lipotropin precursor (66). 

Gel filtration analysis suggests the existence of some 
larger TGFs-a with estimated molecular weights of 10 to 23 

15 kilodaltons (1, 5, 22, 28, 29, 30 9 33, 34). The nature of these 

larger TGFs-a is unknown. It is possible that several related genes 
coding for TGFs-a are present in the genome. However, Southern 
hybridizations (44) of total human genomic DNA with the 180 bp 
Sau3AI and the 670 bp SacI -Ball fragment of pTGF15-l did not reveal 

20 the presence of multiple genes. Alternatively, the nature of some 
of these larger TGFs might be explained by different types of post 
translational processing of the precursor molecules. It is also 
possible that aggregation of the TGFs-a with some other proteins 
could result in an apparent larger molecular weight. 

25 

4. Expression of TGF-a in E. coli 

TGF-a is made in minute amounts by many tumor cells. 
Indeed, only 1.5 M g of the small TGF-a species has been isolated 
20 from 136 Uters of culture supernatant of the melanoma cell line 
A2058, which is considered an "overproducer" of TGF-a (34). This 
very low availability of the TGF-a from cell culture has hampered 
its biological characterization. In order to facilitate these 
studies the synthesis of human TGF-a in E. coli was pursued. 

35 
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As the sequence coding for TGF-a is embedded in a 
precursor, we introduced a start codon in front and a stop codon 
behind the coding sequence. The start codin is preceded by an EcoRI 
recognition site and the stop codon is followed by a BgTI I site, so 
that the TGF-a sequence becomes available as a portable EcoR I -Bgll l 
restriction fragment (Fig, 4). 



TGF-a was expressed as part of different fusion proteins in 
E.. coli in a way similar to human somotostatin (72) , insulin (73) 

10 and desacetylthymosin-al (74). In these latter cases the mature 

protein can be cleaved from the fusion polypeptide using cyanogen 
bromide. This chemical treatment results in the specific cleavage 
behind the methionine residue (72), which was introduced to connect 
the front part of the fusion protein with the mature polypeptide. 

^ 5 Two plasmids were designed so that the TGF-a coding sequence and its 
preceding methionine codon are linked at the EcoR I site to the 
sequence for the front part of a trp leader - trp E fusion protein 
(trp aLE 1413, ref. 76) . The expression of this fusion protein is 
under the control of the trp promoter using the trp leader ribosome 

20 binding sequence. In the case of the expression plasmid pTE6, the 

sequence for the first 190 amino acids of this trp aLE 1413 fusion 
protein, including several cysteine residues, is linked in frame 
with the TGF-a sequence and followed by a stop codon. In plasmid 
pTE5 the sequence coding for only the first 17 amino acids of this 

25 trp LE fusion protein precedes the TGF-a DNA sequence. This stretch 

of 17 amino acids does not contain any cysteines, so that the 
presence of this NH 2 -tenninus would probably not affect the 
disulfide bond formation of the expressed protein (Fig. 5a). In 
both cases cleavage with cyanogen bromide will release mature TGF-a 

30 because of the presence of a methionine codon in front of the TGF-a 

coding sequence. 



Figure 4 shows a schematic representation of how several 
plasmids for expression of TGF-a (amino acids 40-89, Fig. 3), with 
35 or without its downstream sequence (amino acids 90-160, Fig. 3) were 
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constructed. Restriction mapping indicated that in pTGF15-l, the 
180 bp Sau3AI -BglI I fragment which carries the TGF-a exon is 
contained within a 380 bp PvuII-Smal fragment- This latter fragment 
was isolated, denatured and renatured in the presence of the 
synthetic oligonucleotide dCATGGTGGTGTCCCATTTT, which was 5'-labeled 
using Y 32 P-ATP and T4 kinase. E. coli DNA polymerase I Klenow 
fragment was added to the mixture to catalyze the repair synthesis 
as described (67). Using this primer repair technique the CATG 
sequence was introduced in front of the coding sequence for TGF-a. 
The reaction products were then cut with Bgll l and the 130 bp 
fragment containing the partial TGF- a sequence was isolated by 
polyacryl amide gel electrophoresis. Plasmid pYG121, which is 
essentially identical to the expression plasmid pIFN-s2 (pBoIFN-e2) 
(77) except that the bovine IFN-b2 DNA sequence is replaced by a 
short synthetic DNA fragment, was opened at its unique EcoRI site, 
filled in with IE. coli DNA polymerase I (Klenow fragment), and cut 
with BglH . The 130 bp TGF-a fragment was ligated into this vector, 
thus restoring the EcoR I site. The resulting plasmid is called pTEl. 

The 130 bp EcoR I -BglH restriction fragment which contains 
the sequence coding for the first 33 amino acids was isolated from 
pTEl and ligated to the 350 bp Bglll -BamHI fragment of pFIFtrp 3 69 
(67), which contains the front part of the tetracycline resistance 
gene, and to the large vector fragment of the EcoR I and BamH I cut 
pINCV-PA13-33 (i.e., pINCV (78), containing a plasminogen activator 
cDNA insert). The resulting plasmid pTE2 contains the sequence for 
the first 33 amino acids of TGF-a linked via an EcoR I site to the 
sequence coding for the NH 2 -terminal 17 amino acids of the 
aforementioned trp ALE 1413 (ref. T 76) fusion protein. 

The 130 bp RI -Bgll l fragment containing the TGF-a sequence 
was also linked to the sequence coding for the first 190 amino acids 
of the aforementioned trp LE fusion polypeptide. This plasmid, 
pTE3, was constructed by ligation of the 130 bp RI -Bgll l fragment of 
pTEl to the Bgll l-Bam fragment of pFIFtrp369 and the large EcoR I 
and BamH I vector fragment of pNCV (51). 
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The plasmids for the expression of the complete TGF- a 
sequence as a short fusion was obtained as follows. pTE2 was cut at 
its unique PstI and Bgllll site and the larger fragment was 
isolated. The cDMA plasmid pTGF-Cl was cleaved with PstI (position 

5 223, Fig. 3) and Avail (position 301, Fig. 3) and the 78 bp TGF-a 

specific fragment was isolated. Both fragments were ligated in the 
presence of the partially complementary oligonucleotides 
dGACCTCCTGGCCTAA and dGATGTTAGGCCAGG AG . These oligomers introduce a 
stop behind the TGF-a coding sequence and connect the Avai l site 

10 with the Bglll site, The resulting plasmid is pTE5. Plasmid pTE6 
has the complete coding sequence for the first 190 amino acids of 
the trp LE fusion. It was made by ligation of the TGF-a specific 
520 bp long EcoR I -BamH I fragment into the large EcoRI-BamHI fragment 
of pHCV (51). The trp promoter controls the synthesis of the TGF-a 

15 short and long fusion proteins from plasmids pTE5 and pTE6, and the 
expression of the Tet gene. 

For the direct expression of the DNA sequence (Fig. 4d) for 
TGF-a, with its connecting downstream coding sequence, plasmid 
20 XAP-PA2 was cleaved at the unique Xbal and Bgll l sites and the large 
vector fragment was isolated. XAP-PA2 is a plasmid essentially 
similar to pFIF-trp69 (67) except that the IFN-b cDHA sequence is 
replaced by the cDNA sequence for human tissue plasminogen 
activator, modified as in plasmid pt-PA trpl2 (70), and that the 641 

n 

25 bp Aval-PvuI I fragment downstream of the Tet gene was deleted. 
In addition, the 70 bp Xbal and Bgll l fragment was isolated 
containing the start of the TGF-a sequence from pTEl and the 345 bp 
PstI-Sau3 AI fragment which contains the rest of the coding sequence 
from pTGF-Cl . The three fragments were ligated. The resulting 

30 plasmid, pTE4, contains the coding sequence preceded by a start 
codon, as an EcoR I -Bgll l fragment under the control of the trp 
promoter. 

In addition, to the plasmid pTE4 for direct expression, we 
35 also envisaged the expression of this coding sequence as a fusion 
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protein with the NH 2 -terminal part of the trp leader protein. The 
TGF-a specific Pstl-Bglll fragment of 350 bp long was therefore 
isolated from pTE4 and li gated into the large Pst-BglH fragment of 
plasmid pTE2. The resulting plasmid is called pTE7. Alternatively, 
the TGF-a sequence containing EcoR I-BamHI fragment of pTE4 was 
ligated into the large EcoR I -BamH I fragment of pTE3, giving rise to 
plasmid pTE8. Plasmids pTE7 and pTE8 contain the TGF-a DNA sequence 
and its downstream coding sequence linked to the first 17 or 190 
amino acids, respectively, of the trp LE 1413 fusion protein (ref. 
76). The ligation of these sequences is via an EcoRI site which is 
followed by the ATG codon for methionine. The presence of the 
methionine in front of the TGF-a sequence makes it possible to 
cleave the fusion protein specifically behind this residue. 

Restriction enzymes were purchased from New England Biolabs 
or Bethesda Research Laboratories. T4 kinase was from New England 
Nuclear, T4 DNA ligase was from Bethesda Research Laboratories, and 
E. coli DNA polymerase I (Klenow fragment) was from New England 
Nuclear or Boehringer Mannheim. All enzymes were used essentially 
as recommended by the manufacturers. 

T4 kinase reactions were performed in 70 mM Tris-Cl pH 7.6, 
10 mM MgCl 2 and 5mM dithiothrei to! . Ligations were in 20 mM 
Tris-Cl, pH 7.6, 50 mM NaCl , 6 mM MgCl 2 , 10 mM dithiothreitol , 0.5 
mM ATP. Restriction digestions were in 6 mM Tris-Hcl, 6 mM MgCl 2 , 
6 mM 8-mercaptoethanol , while the "fill -in" reactions using 
DNA-polymerase I (Klenow fragment) were done in the same buffer 
supplemented with 20 pM of each of the 4 dNTPs. 

Fig. 5 fives a schematic representation of plasmids pTE5 
and pTE6 and shows the amino acid sequence connecting TGF-a with the 
preceding trp LE fusion protein sequence. 

The introduction of these plasmids pTE5 and pTE6 into 
E. coli W3110 and the subsequent induction of the trp promoter 
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results in the synthesis of high levels of the TGF-a fusion 
proteins. The long fusion protein, expressed from plasmid pTE6, 
constitutes the major protein by far in a total bacterial lysate. 
The short fusion protein encoded by plasmid pTE5 is not synthesized 
in such high abundance but is nevertheless easily detectable as a 
prominent band in the JE. col i lysate (Fig. 6). 

Fig. 6 shows the results after electrophoresis in a SDS-13 
percent polyacryl amide gel (79) of total lysates of _E. coli 
containing the expression plasmids pTE2, pTE3, pTE5 or pTE6. 
IE. coli W3110, transformed with these plasmids was grown at 37°C in 
M9 medium containing casamino acids and tetracycline (5 ug/ml) to 
0D 550 = 0.1. Expression from the trp promoter was boosted by 
adding indolacetic acid to 20 ug/ml. Three ml of bacteria were 
harvested at 0D 55Q = 0.1 before induction, while a similar number 
of bacteria were collected at 00550 s 1-0 after induction in the 
case of pTE2 and pTE5 or OD 55Q = 0.7 for pTE3 and pTE6. The 
pelleted bacteria were resuspended in 30 ul 10 mM Tris-HCl pH 7.5, 1 
mM EDTA and 3 pi 1 H e-mercaptoethnol and 6 ill 10 percent SDS were 
added. The mixtures were heated for 2 mi n at 95*C and 300 \il cold 
acetone was added. The acetone precipitate was pelleted, dissolved 
in 25 i.1 SDS-loading buffer (5 percent 6-mercaptoethanol , 4 percent 
SDS, 0.125 M Tris-HCl, pH 6.8, 20 percent glycerol) and after 
heating (2 min. 95'C) loaded on an SDS-13 percent polyacryl amide 
gel. The gel was stained with Coomassie Brilliant Blue. Bacterial 
lysates before and after induction are shown. The arrow marks the 
TGF-a fusion protein. The position of the molecular weight markers 
is shown at the right of Figure 6. 

The short TGF-a fusion proteins f rom _E. coli transformed 
with pTE5 was substantially enriched in order to determine the 
biological activity. Two procedures, not involving column 
chromatography, resulted in a purity of 80 to 90 percent. One 
method is based on the observation that the short TGF-a fusion 
protein is, in contrast to many other proteins, apparently insoluble 
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1n the presence of 0-4 M NaCl and 0.5 percent NP40. The precipitate 
can then be solubilized in 8 M urea and dialyzed to 1 M acetic 
acid. The soluble fraction contains mostly the TGF-a fusion 
protein. Alternatively, the bacteria were sonicated in 70 percent 
ethanol acidified with HC1, and the TGF-a fusion protein was 
precipitated from the cleared supernatant by an ether-ethanol 
precipitation and dissolved in 1 M acetic acid essentially as 
described (18). Gel electrophoretic analysis showed that both 
procedures resulted in an equally efficient enrichment (Fig. 7). 

Figure 7 shows an SDS-polyacryl amide gel (79) of the 
bacterial short TGF-a fusion protein, enriched by the acid-ethanol 
method. The 68 amino acid long TGF-a fusion protein migrates as a 
broad band (arrow) in this gel. The enrichment of the protein by 
the NP40-NaCl method is very similar. 

The bacterial TGF-a short fusion protein thus obtained was 
tested in two different assays. It has been shown that natural 
TGF-a competes with EGF for the same receptor (5, 8, 9). This has 
led to the development of a fast and quantitative binding assay for 
TGF-a, based on the competition with 125 I -labelled EGF (5, 14, 
27). The results from binding experiments using NRK cells (15, 19, 
27) indicate unambiguously that the TGF-a short fusion protein, as 
well as the TGF-a generated by cyanogen bromide cleavage (not shown) 
from the same fusion protein, bind to the EGF-receptor. However, 
the binding of the short TGF-a fusion protein is only 0.5-1 
percent of the expected value, if one assumes quantitatively 
equivalent binding of TGF-a and EGF for the same receptor under the 
experimental conditions. It is possible that the low value may be 
due to an intrinsically lower binding affinity of human TGF-a, or to 
the presence of molecules with aberrant configurations or to the use 
of binding conditions which are not optimal for the bacterial TGF-a. 

The A panel of Figure 8 shows* the competition of the 
bacterial TGF-a short fusion with 125 I -label led EGF in a 
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radioreceptor assay (solid line). The calibration curve with EGF is 
shown as a dashed line. Panel B shows the EGF calibration curve 
separately, 

5 The biological activity of TGF-a can be measured by its 

ability to induce anchorage independence of non-transformed cells, 
such as NRK cells. The presence of TGF-a or EGF induces the 
formation of colonies. The number and size of these colonies is 
strongly increased in the presence of TGF-b (13-14). The bacterial 

1° TGF-a, purified as indicated before, was tested for the ability to 

induce anchorage independence, using NRK cells (140). Figure 9 also 
shows soft-agar colony-forming activity of murine EGF and the 
bacterial TGF-a fusion protein before and after cleavage with 
cyanogen bromide (75). The assay was performed in the presence of 

15 TGF-b with NRK cells, clone 49F, as described (14). The ordinate 
scores the number of colonies larger than 850 urn 2 , while the 
abscissa indicates the concentrations of EGF or bacterial TGF-a, 
expressed in EGF equivalents (ng/ml), as determined in the EGF 
receptor binding assay. The dashed line shows the curve for EGF, 

20 while the solid lines give the results for the bacterial TGF-a 
fusion protein, before (o) or after (o) cleavage with cyanogen 
bromide. These results clearly show that the TGF-a short fusion 
triggers colony formation in soft agar in the presence of TGF-b. It 
is remarkable that in these assays the bacterial TGF-a is about 20 

25 to 30 times more active relative to EGF than it is in the 

radioreceptor assay. This quantitative difference was also apparent 
with the cyanogen bromide cleaved TGF-a fusion protein. The number 
and the size of colonies, induced by the bacterial TGF-a, in the 
absence of TGF-b, is much smaller than in the presence of TGF-b, but 

30 also under these assay conditions the quantitative differences 
between the bacterial TGF-a and EGF remain unchanged. 

The results from both types of assays show that the 
bacterial TGF-a fusion protein, with its additional 17 
35 NH 2 - terminal amino acids, and the cyanogen bromide cleaved protein 
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compete with EGF and can induce anchorage independence of NRK 
cells- The ability to induce colony formation in soft agar is much 
higher than expected on the basis of the radioreceptor assay. In 
all cases TGF-a has 20 to 30 fold higher activity in soft agar than 
5 it has in the radioreceptor assay {relative to EGF). 

Massague et jfL (12) have presented evidence that TGF-a 
binds not only to the EGF receptor but also to a 60 kd TGF-a 
specific receptor. It is possible that the binding to this latter 

10 receptor medites in part the induction of the anchorage-dependent 
character by TGF- a . If so, the EGF radioreceptor assay may not be 
predictive in an absolute manner for the colony formation in soft 
agar. It could also be possible that the binding characteristics of 
TGF-a to either of these receptors are not the same. In 

15 contradiction to this, Carpenter et al_. (8) have recently shown that 
the induction of anchorage independence by TGF-a can be blocked by 
the presence of antisera to the EGF receptor, indicating that the 
binding to this receptor is required for the appearance of colonies 
in soft agar. It is also possible that the binding of the bacterial 

20 TGF-a to the EGF receptor is more efficient under conditions for the 
soft agar assay than during the radioreceptor assay, and that this 
may explain the quantitative differences observed between both 
assays. 

25 5. Expression of TGF-g in Yeast 

A plasmid was designed which was aimed at the expression of 
TGF-a in yeast and subsequent secretion into the yeast medium. For 
this purpose we explored the use of the gene coding for the mating 
factor-a in yeast. This a-factor is secreted from the yeast 

30 cerevisiae and plays an important physiological role in the mating 

process. The gene for this a-factor has been isolated and codes for 
a large precursor. The p rep ro-a-f actor polypeptide comprises an 
amino-terminal signal peptide needed during the secretion process, 
and 4 identical a-factor peptide units. Release of these a-factor 

35 peptides most likely involves a proteolytic cleavage of the 
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precursor at the dibasic Lys-Arg residues, located downstream of the 
signal peptide and in front of the a-factor units (J- Kurjan et al . , 
1982, "Cell" 30: 933; Singh et a^ , 1983, "Nucl. Acids Res." }U 
4049. A TGF-a expression plasid (pyTE2) was constructed in which 
the sequence coding for the 50 amino acid TGF-a and its flanking 
stop codon was introduced in frame immediately following the codons 
for the yeast a-factor Lys-Arg dipeptide. The sequence for the 
amino terminal part of the prepro-a-f actor including the signal 
peptide is retained. The expression of this TGF-a fusion 
polypeptide is under the control of the a-factor promoter. Suitable 
starting plasmids for this and equivalent constructions are 
described in EP 123,544A. Expression plasmid pyTE2 (Fig. 10), also 
contains a functional yeast replication origin derived from the 2m 
plasmid {J. Hart! ey et aK , 1980, "Nature" 286: 860), a 
transcriptional terminator and polyadenylation site from the "Able" 
gene (J. Hartley et a]_., idj and the TRP-1 selection marker (G. 
Tschumper et al., 1980 "Gene" U): 157-166). 

Saccharomyces cerevisiae strain 20B-112 (E. Jones, 1976, 
"Genetics" 85: 23) was transformed with pyTE2. The medium of the 
transformed yeast was assayed both in the radioreceptor assay and in 
the soft agar colony formation. Using both assays, biologically 
active TGF-a could be detected in the yeast medium. Further 
analysis showed that about 8 ng of TGF-a can be recovered per ml of 
medium and that following the synthesis in yeast more than 90 
percent of the biologically active TGF-a is secreted. It is likely 
that the secreted TGF-a has the proper disulfide bond configuration, 
since refolding the disulfide bridges is not needed for activity. 
Using a similar a-factor based expression vector, Brake et al . , 
1984, "Proc. Natl. Acad. Sci. USA" 81^: 4542, have recently been able 
to express and secrete human EGF from yeast. 

6. Expression of a TGF-gC Polypeptide 

pTE4 was digested with Bgll and BamHI and the 
TGF-aC-containing fragment recovered. An oligonucleotide primer 



2846Y 



- 47 - 



PI 54434 



having the sequence 

MetllelleThrCysValLeu 
G ATC G* AATTC ATG ATC ATC AC ATG TG TG C TG 

EcoRI 

was prepared, representing an EcoRI site, a methionine codon and the 
first six residues of a TGF-aC polypeptide starting at He 115 of 
the precursor. The regions in the TGF-aC-containing fragment which 
were 5 1 to this primer were deleted by primer extension following 
conventional procedures. The DNA encoding the TGF-aC polypeptide 
was recovered by gel electrophoresis of an EcoRI and BglH digest. 
Alternatively, this DNA could be prepared by organic synthesis. 

pTE5 was digested with EcoRI and Bglll , and the large 
vector fragment recovered. The recovered pTE5 fragment and the 
EcoRI -BglU TGF-aC fragment from the previous steps were ligated 
with T4 ligase, the mixture used to transform E. coli and the 
bacteria cultured as described above. TGF-aC was relatively toxic 
to E. coli * Better yields might be obtained by modifying the 
oligonucleotide so that the E. coli STII or alkaline phosphatase 
signals are inserted in place of the ATG start codon, or by 
expressing the TGF-aC gene in a mammalian cell transformation 
host-vector system. 
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CLAIMS 

1. A bandage impregnated with a product comprising human 
mature TGF-a, TGF-aC, TGF-aN or TGF-amC essentially free of 

5 other human proteins. 

2. A composition comprising precursor TGF-a or a fragment 
thereof produced by heterologous recombinant host cells. 

10 3. A composition of claim 2 wherein the precursor of 
TGF-a or its fragment is a mutant. 

4. A composition of claim 2 conjugated to a heterologous 
polypeptide which is heterologous to the organism from which 

15 was obtained the recombinant DNA encoding the precursor 

TGF-a or its fragment, the heterologous polypeptide being a 
microbial s ignal sequence . 

5. A composition of claim 2 wherein the precursor TGF-a 
2 0 or its fragment is human and additionally comprising a 

predetermined amount of a human protein. 

6. A composition of claim 2 which is labelled with a 
detectable moiety. 

25 

7. A composition of claim 2 which further comprises a 
pharmaceutical^ acceptable carrier and a therapeutically 
effective amount of precursor TGF-a or its fragment. 

30 8. A composition of claim 7 which is sterile and which 
further comprises a predetermined amount of TGF-6. 

9. A composition of claim 8 for topical application to a 
wound . 
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10. A DNA sequence encoding precursor TGF-a or a fragment 
thereof, provided that when the fragment is mature TGF-a 
then the mature TGF-a is human. 

5 11« The DNA sequence of claim 10 ligated to DNA encoding a 
heterologous polypeptide. 

12. A replicable vector comprising the DNA sequence of 
claim 10 or claim 11. 

10 

13. A cell transformed with the vector of claim 12. 

14. A method for producing a polypeptide comprising TGF-a 
precursor or a fragment thereof, comprising: 

15 (a) preparing an expression vector capable of acting 

in concert with a host cell to express a DNA sequence 
encoding said polypeptide? 

(b) transforming said host cell with the expression 
vector to obtain a recombinant host cell? 
20 (c) culturing the recombinant host cell under condi- 

tions for expression of the product; and 

(d) recovering the product from the host cell culture. 

15. A diagnostic method comprising assaying a body tissue 
25 or fluid for the presence of TGF-amC or TGF-aC. 

16. A diagnostic method. for determining TGF-amC 
comprising: 

(a) contacting a body fluid sample with an antibody 
30 capable of binding the TGF-aC domain of TGF-amC? 

(b) contacting the body fluid sample with an antibody 
capable of binding the mature TGF-a domain of TGF-amC; 

(c) labelling the antibody of parts (a) or (b) with a 
detectable moiety; 

35 (d) insolubilizing the antibody which is not labelled; 
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(e) separating the soluble and insoluble phases; and 

(f ) measuring the distribution of label between the 
phases . 

5 17. An antibody raised against a predetermined amino acid 
sequence within a PRTGF-a species. 

18. An antibody according to claim 17 for administration 
to an animal . 

10 

19. A sterile therapeutic composition comprising a 
pharmaceutically acceptable carrier and a therapeutically 
effective dose of a TGF-a binding agent for inhibiting the 
activation of cell surface receptors by TGF-a. 

15 
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